Active 3D Segmentation through Fixation of Previously Unseen Objects
نویسندگان
چکیده
We present a method for active object segmentation based on integration of several cues; image point positions, binocular disparities and pixel colours. It serves as a framework for generation of object hypotheses of previously unseen objects in natural indoor scenes. The appearance, 3D shape and size of objects are modelled in an iterative manner using an approximate Expectation-Maximisation (EM) method, that takes the dependencies between neighbouring pixel labels into consideration, unlike typical methods that assume neighbouring pixels to be independent. To better cope with situations when an object is hard to segregate from the surface it is placed on, possibly due to ambiguities in appearence, we propose a flat surface model as a complement to the two models typically used in figure-ground segmentation. A flat surface assumption is reasonable, given that most objects in indoor scenes are placed on flat, or at least locally flat, surfaces. We will show that even if no such physical plane exists in the scene, foreground segmentation succeeds anyway, since the flat surface model will become just another background model. We further let the segmentation evolve over time, this in order to provide more information and gradually improve segmentation and to facilitate tracking. Figure-ground segmentation is done using three different models, each described by a set of parameters; the foreground model θ f , the background model θb and that of the flat surface θs. Each pixel has an associated label li ∈ {l f , lb, ls}, depending on which component it belongs to. The model parameters θ = θ f ∪ θb ∪ θs and the labellings of all pixels l = {li} are unknown and estimated from the measurements m = {mi} at each pixel. With EM the maximum likelihood estimate of the model parameters θ is computed iteratively in two steps. In the first step (E-step) the conditional distribution w(l) = P(l|m,θ ′) is computed using the current estimate θ ′ and in the second step (M-step) a new estimate is found by maximising Q(θ |θ ′) = ∑l w(l) logP(m, l|θ). Unfortunately, since this summation is done over N3 different labellings, where N is the number of pixels, it quickly becomes prohibitly expensive. To make it computationally tractable we replace w(l) with the product of the conditional marginals for each unobserved label, w(li) = P(li|m,θ ′). Since a measurement mi depends only on its associated label li, the second step becomes a maximisation of
منابع مشابه
A Hybrid 3D Colon Segmentation Method Using Modified Geometric Deformable Models
Introduction: Nowadays virtual colonoscopy has become a reliable and efficient method of detecting primary stages of colon cancer such as polyp detection. One of the most important and crucial stages of virtual colonoscopy is colon segmentation because an incorrect segmentation may lead to a misdiagnosis. Materials and Methods: In this work, a hybrid method based on Geometric Deformable Models...
متن کاملA Semi-Automated Algorithm for Segmentation of the Left Atrial Appendage Landing Zone: Application in Left Atrial Appendage Occlusion Procedures
Background: Mechanical occlusion of the Left atrial appendage (LAA) using a purpose-built device has emerged as an effective prophylactic treatment in patients with atrial fibrillation at risk of stroke and a contraindication for anticoagulation. A crucial step in procedural planning is the choice of the device size. This is currently based on the manual analysis of the “Device Landing Zone” fr...
متن کامل3D Active Shape Model Segmentation with Nonlinear Shape Priors
The Active Shape Model (ASM) is a segmentation algorithm which uses a Statistical Shape Model (SSM) to constrain segmentations to 'plausible' shapes. This makes it possible to robustly segment organs with low contrast to adjacent structures. The standard SSM assumes that shapes are Gaussian distributed, which implies that unseen shapes can be expressed by linear combinations of the training sha...
متن کاملInteractive Medical Image Segmentation using Deep Learning with Image-specific Fine-tuning
Convolutional neural networks (CNNs) have achieved state-of-the-art performance for automatic medical image segmentation. However, they have not demonstrated sufficiently accurate and robust results for clinical use. In addition, they are limited by the lack of image-specific adaptation and the lack of generalizability to previously unseen object classes. To address these problems, we propose a...
متن کاملPrecise 3D Reconstruction of Cultural Objects Using Combined Multi-component Image Matching and Active Contours Segmentation
Cultural and creative industries constitute a large range of economic activities. Towards this expansion we need to state the inclusion of ICT technologies, as such of 3D reconstruction methods. However, precise 3D reconstruction under a computationally affordable manner is a research challenge. One way to precisely reconstruct a cultural object is through the use of photogrammetry with the mai...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010